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Abstract 

These notes focus on the applications of the stochastic Taylor expansion of solutions 
of stochastic differential equations to the study of heat kernels in small times. As an 
illustration of these methods we provide a new heat kernel proof of the Chern-Gauss- 
Bonnet theorem. 
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1 Introduction 



The purpose of these notes is to provide to the reader an introduction to the theory 
of stochastic Taylor expansions with a view toward the study of heat kernels. They 
correspond to a five hours course given at a Spring school in June 2009. 

In the first Section, we remind some basic facts about stochastic differential equations and 
introduce the language of vector fields. In the second Section, which is the heart of this 
course, we study stochastic Taylor expansions by means of the so-called formal Chen series. 
Parts of this section may be found in my book |3j , but the proofs given in these notes are 
different and maybe more intuitive. In the third Section, we focus on the applications of 
study stochastic Taylor expansions to the study of the asymptotics in small times of heat 
kernels associated with elliptic diffusion operators. In the fourth Section, we extend the 
results of the third Section, to study heat kernels on vector bundles and provide a new 
proof of the Chern-Gauss-Bonnet theorem. 



2 Stochastic differential equations: The language of vector 



In this section we remind some preliminary results and definitions that will be used 
throughout the text. We focus on the connection between parabolic linear diffusion equa- 
tions and stochastic differential equations and introduce the language of vector fields which 
is the most convenient when dealing with applications to geometry. 

For further reading on the connection between diffusion equations and stochastic differ- 
ential equations we refer to the book by Stroock and Varadhan [36J, where the proofs of 
the below cited results may be found. For further reading on vector fields we refer to the 
Chapter 1 of [39J and for more explanations on the use of the language of vector fields for 
stochastic differential equations, we refer to the Chapter 1 of |21j . 

2.1 Heat kernels 

A diffusion operator L on W 1 is a second order differential operator that can be written 



where 6, and a^- are continuous functions on W 1 such that for every x G M. n , the matrix 
(aij(x))i<ij< n is symmetric and positive. 

Associated to L, we may consider the following diffusion equation 



fields 




The function $ : [0, +oo) x R n 
f : W 1 -> R is the initial datum. 
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Under mild conditions on the coefficients b{ and a^- (for instance C°° bounded), it is well 
known that the above equation has one and only one solution It is usual to use the 
notation 

$(t,x)=F t f(x) 

to stress how the solution $ depends on the function /. The family of operators (Pt)t>o 
(acting on a convenient space of initial dats0) is called the heat semigroup associated to 
the diffusion operator L. The terminology semigroups stems from the following easily 
checked property 

Pt+s = P*Ps- 

The diffusion operator L is said to be elliptic at a point xq S M n if the matrix (ay(iEo))l<i,j<r, 
is invertible. We shall simply say that L is elliptic if it is elliptic at any point. 
If L is an elliptic diffusion operator, the heat semigroup associated to it admits the fol- 
lowing integral representation 

Pt/(x) = ( Pt (x,y)f(y)dy, t > 0, 

where p : (0, +00) x M. n x W 1 —* R is a smooth function that is called the heat kernel 
associated to L. 

2.2 Stochastic differential equations and diffusion equations 

Stochastic differential equations provide a powerful tool to study diffusion equations and 
associated heat kernels. Let us briefly recall below the main connection between these two 
types of equations. 
Let 

1 n d 2 n d 

1,3 = 1 J 4=1 

be a diffusion operator on W 1 . Since the matrix a is symmetric and positive, it admits a 
square root, that is, there exists a symmetric and positive matrix a such that 

a 2 = a. 

Let us consider a filtered probability space (0, (^t)t>o, J 7 , IP) which satisfies the usual 
conditions and on which is defined a n-dimensional Brownian motion (Bt)t>o- 
The following theorem is well known: 

Theorem 2.1 Let us assume that b and a are smooth, and that their derivatives of any 
order are bounded. 

Then, for every xq S 1" , there exists a unique and adapted process {Xf°)t>0 such that for 
t>0 

X?°=x Q +[ b(X*°)ds+ f a{X x s °)dB s . (2.1) 
Jo Jo 

In probability theory, it is common to work with the space of bounded Borel functions 
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Moreover, if f : M n —* R is a smooth and compactly supported function, then the function 

<Kt,x)=E(f(X?)) = P t f(x) 
is the unique bounded solution of the diffusion equation 

^{t,x)=L<f>(t,x), <f>(0,x) = f(x). 
2.3 The language of vector fields 

For geometric purposes, it is often more useful to use Stratonovitch integrals and the 

language of vector fields in the study of stochastic differential equations. 

Let O C R n be a non empty open set. A smooth vector field V on O is a smooth map 

V : O -» R" 

x -> (vi(x), ...,v n (x)). 

A vector field V defines a differential operator acting on the smooth functions / : O — ► R 
as follows: 

We note that V is a derivation, that is a map on C°°(0,R), linear over R, satisfying for 
f,g€C°°(0,R), 

V{fg) = (Vf)g + f{Vg). 

Conversely, it may be shown that any derivation on C°°(0,R) is a vector field. 

With these notations, it is readily checked that if Vo, V%, ■ ■ ■ ,Vd are smooth vector fields 

on R n , then the second order differential operator 

i=l 

is a diffusion operator. Though it is always locally true, in general, a diffusion operator 
may not necessarily be globally written under the above form. If this is the case, the 
operator is said to be a Hormander's type operator. We may observe as an easy exercise 
that the operator is elliptic if and only if for every x E R n , the linear space generated by 
the vectors V±(x), • • • , Vd(x) is equal to R n . 

To associate with L a stochastic differential equation it is more convenient to use Stratonovitch 
integration than Ito's. Let us recall that if (Xt)t>o and (Yt)t>o are two continuous semi- 
martingales, the Stratonovitch integral of Y against X may be defined by 

j YodX = f YdX + ^(X,Y), 

where {X, Y) denotes the quadratic covariation between X and Y . 
With this language, we have the following translation of Theorem 12.11 
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Theorem 2.2 Let (Bt)t>o be a d- dimensional Brownian motion. Let us assume that 
Vq,Vx, ■ ■ ■ ,Vd are smooth vector fields on 1", and that their derivatives of any order are 
bounded. 

Then, for every xq S W 1 , there exists a unique and adapted process (X^°)t>o such that for 
t > 

X? = x + f V (X*°)ds + J2f Vi(X?) o dBl (2.2) 
Jo i=1 Jo 

Moreover, if f : W 1 — > M is a smooth and compactly supported function, then the function 

0(t,s)=E (/(*?)) = P t /(x) 

is the unique bounded solution of the diffusion equation 

^(t,x) = L0(t,x), <f>(0,x) = f(x). 

where 

L = Vo + lj2v?. 

i=l 

The main advantage of this language is the following simple change of variable formula 
which is nothing else but the celebrated Ito's formula: 

Proposition 2.3 If f : W 1 — > R is a smooth function and (X^°) t >o the solution of h2.$fy . 
then the following ltd 's formula holds 

f(Xn = f(x ) + f V f(XT)ds + J2 f Vif(X?) o dBl 
Jo i=1 Jo 

3 Stochastic Taylor expansions 

Our goal is to study solutions of stochastic differential equations in small times. A powerful 
tool to do so is the stochastic Taylor expansion whose scheme is described in the first 
subsection below. After subtle algebraic manipulations of the stochastic Taylor expansion 
involving the so-called formal Chen series, it is possible to deduce an approximation in 
small times of the flow (x — > Xf)t>o associated to the given stochastic differential equation. 

3.1 Motivation 

Let / : R" -> K be a C°° bounded function and denote by (X?°) t >o the solution of (I2T2D 
with initial condition x G R n . First, by Ito's formula, we have 

f(Xf) = f(x) + £ l\vif){X* s ) o dBl t > 0, 
i=o Jo 
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where we use the notation B® = t. Now, a new application of Ito's formula to Vif(X* 
leads to 



f(X't) = f(x) + ^T(VJ)(x)Bl + I i\VjVif)(XZ) o dBi o dB\. 
i=i i, j= i J ° J ° 

We may iterate this process. For this, let us introduce the following notations: 

1. 

A k [0,t] = {(t 1 ,...,t k )e[0,t] k ,t 1 <... <t k }; 
2. If I = (ii, ...if.) £ {0, ...,d} k is a word with length k, 



[ odB 1 = [ odB\\ o ... o dB%, 

JA k [0,t] Jo<ti<...<t k <t 



and n(I) is the number of 0's in /. 
We can then continue the above procedure and get that for every N > 1 

N 

/(Xf) = /(x) + V V (V h ...V ik f)(x) odB 1 + R N (tJ,x), 



fc=i Je{o,...,d} fe ,fc+n(/)<JV 



for some remainder term Rjv(i,/, x) which is easily computed, and shown to satisfy 
sup VlE(R7v(t,/,x) 2 ) < CW*^ sup ||^ 1 ---y ife /|| 00 . 

xeK™ (i 1 ,...,i k ),k+n(I)=N+l or N+2 

This shows that, in small times, the sum 

TV 

/(*) + E E (Vn-ViJ)(x) odB 1 (3.3) 

fc=l /£{0,...,d} fe ,fc+n(/)<iV A fe [0,i] 

is a more and more accurate approximation of f(Xf) when N — ► +oo. 

Remark 3.1 For further details on the above discussion, we refer to Ben Arous |7|/ and 
Kloeden- Platen WM- Related discussions for the Taylor expansion of solutions of equations 
driven by fractional Brownian motions may be found in Baudoin-Coutin OV. For the case 
of rough paths we refer to Inahama \22$ and Friz- Victoir 
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3.2 Chen series 



Our goal is now transform the approximation given by the Taylor expansion (|3.3p into 
an approximation of the stochastic flow associated to the equation (|2.2[) . That is, at any 
order, we wish to construct an explicit random diffeomorphism such that 

- r 

(<S>?f)(x):=f(<S>?(x))=f(x)+J2 E (V h ...V ik f)(x) I odfl'+Rfrft/,*), 

k=l l£{0,...,d} k ,k+n(I)<N A k [0,t] 

for some remainder term ~R* N (t, f, x). 

In order to do so, the main tool was introduced by K.T. Chen in [12] in his seminal paper 
of 1957. Chen considered formal Taylor series associated to paths (or currents) and proved 
that such series could be represented as the exponential of Lie series. 
We present in this subsection those results. 

Let M[pfo, Xd\] be the non commutative algebra over R of the formal series with d + 1 
indeterminates, that is the set of series 

+oo 

Y = V0 + Y1 E aii,...,ik X h- X i k - 
k=l l£{0,l,...,d} k 



Definition 3.2 If x : M>o — > M d is an absolutely continuous path, the Chen series of x is 
the formal series: 

+ 00 y , 

6(x), = l + E E / dx\\---dxl 
k=i ie{o,i,...,d} k Vio<*i<...<i fc <t 

with the convention x® = t. 

The exponential of Y £ M.[[Xo, ...,Xd]] is defined by 

+ 00 yf. 

«p(y) = E fcf. 

fc=0 

and the logarithm of Y by 

+°° (_-\\k 
HY) = Y,-T L ( Y - 1 ) k - 

k=l 

The Chen-Strichartz formula that we will prove in this subsection, is an explicit formula 
for In &(x)t- 



jX h ---X ik , t>0, 
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Remark 3.3 As a preliminary, let us first try to understand a simple case: the commu- 
tative case. 

We denote Sk the group of the permutations of the index set {1, ...,k} and if a G Sk, we 
denote for a word I = (i\, i^), a ■ I the word (vm, If Xq, X\, were 

commuting, we would have 



+00 / r 

= E til-** [yJ2 / 

k=l I=(i u ...,i k ) \ 'aes k j/1 



dx*' 1 

A k [0,t] 



Since 



we get, 



E / dx^ = ^..4\ 



+00 - / d \ 

&{x) t = 1 + E fc? E = exp e x * x i ■ 

k=l ' I={ix,...,i k ) \i=0 / 

We define the Lie bracket between two elements U and V of R[[Xo, by 

[U, V] = uv - vu. 

Moreover, if / = (ii,...,ik) € {0, ...,d} k is a word, we denote by Xj the commutator 
defined by 

X I = [ X ii ,[Xi2> "j [-^ifc-u-^ifc]— ]• 

The universal Chen's theorem asserts that the Chen series of a path is the exponential of 
a Lie series. 

Theorem 3.4 [Chen-Strichartz expansion theorem] If x : M>o — ► M rf is an absolutely 
continuous path, then 

e{x) t = exp E E A K^)i^/ , * > 0, 
\fc>Ue{o,i,...,d}«= / 

where for k > 1, I E {0, 1, d} fc : 

• 5fc is t/ie set of the permutations of {0, k}; 

• If a £ Sk, e(cr) is i/ie cardinality of the set 

{j G {0,..,fc-l},ff(j)>«r(i + l)}, 



2 Rigorously, this means that we work in R[[Xo, Xi, • • ■ , where J is the two-sided ideal generated 

by the relations X%Xj — XjXi — 
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A I (x) t = \ ^ , / dxC {h) ---dxC {ik \ t>0. 

V e ( a ) J 

Remark 3.5 The first terms in the Chen-Strichartz formula are: 
1. 

d 

Ai(x) t X I = Y,4^; 

I=(h) k=0 



£ A I (x) t X I = ± Yl \ x » x i\ f 

I=(h,i 2 ) 0<i<j<d J ° 



xldx J s — x 3 s dx l s . 



We shall give the proof of this theorem in the case where the path x t is piecewise affine 
that is 

dxt = a-idt 

on the interval [ti, ij+i) where = to < t\ < ■ ■ ■ < tjy = T. Since any absolutely continuous 
path is limit of piecewise affine paths, we may then conclude by a limiting argument. The 
proof relies on several lemmas. 

Lemma 3.6 (Chen's relations) Let x t be an absolutely continuous path. For any word 
(ii, i n ) £ {0, 1, d} n and any < s < t, 

f dx^ 1 '-'^ = Y] f dx (il '-' ifc) / d x (ifc+i,-,<n) j 

where we used the following notations: 
1. 



f = f dx\\...dx\ k k - 

JA k [s,t] J s<ti<...<t k <t 



2. if I is a word with length 0, then J A or ot ] °dx I = 1. 
Proof. It follows readily by induction on n by noticing that 



f dx (il '-' in) = [ If ^(ii.-A-i) ] dx\ n . 

JA"[0,t] Jo WA«-i[0,t„] / 



The previous lemma implies the following flow property for the signature: 



□ 
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Lemma 3.7 Let xt be an absolutely continuous path. For < s < t, 

(+00 . N 

i + V V x h ...x ik / dx 1 

Proof. We have, thanks to the previous lemma, 

e(x) s (i + J2J2 x ^- x t* S dxI ) 

V til JAk[s,t] J 

=1+ Yl J2 x ^- x ^ x k- x <, / dxI / dxI ' 

kt^llj' JA"[s,t] JA"'[0,s] 

+00 . 



k=l I 

--&{x) t . 



□ 



With this in hands, we may now come back to the proof of the Chen-Strichartz expan- 
sion theorem in the case where xt is piecewise affine. By using inductively the previous 
proposition, we obtain 



N-l I +00 



1\ -L I T^AJ ~ 

&{x) T = n 1 + e e x n- x i k / L dxi 

n=0 V fc=l/=(i lv ..i fc ) JA k [t n ,t n+ i] 



Since, on [t n ,t n+ i] 
we have 



dxt = a n dt, 



JA k [t n ,t n+ l] JA k [t n ,t n + 1 ] 



(tn+1 ~ tn) k 

k\ ' 



Therefore 

N-l 



&(x)t=YI 1 + E E X h ...X ik 4---a n 



H 



k\ 

n=0 y k=ll=(i u ...i k ) 
N-l I d \ 

= n exp ( ^ n+i ~ e a « x * ) 

n=0 V i=0 / 

We now use the Baker-Campbell-Hausdorff-Dynkin formula (see Dynkin p3] and Strichartz 
[37]): 
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Proposition 3.8 (Baker-Campbell-Hausdorff-Dynkin formula) Ifyi,--- , yjy G M. d+1 
then, 

II exp I ^ Vn X i ) = ex P I X] X] ^ 7 ^ yi ' ' " ' I ' 

n=l \i=0 J \k>He{0,l,...,d} k ) 

where for k> 1, J G {0, 1, : 

0i{yi,--- ,vn) = 1^ / k _ l \ LL Vv '"V* ■ 

cres k o=j <h<---<j N -i<k 7i ! • • • 7jv_i !/c 2 ( , s | u=1 

V e ( (J ) / 

We get therefore: 

&(x) T = exp y~] ^ Piihao, ■ ■ ■ , (t N - tjv-i)ajv-i)X/ . 

\fc>i/ e {o,i,...,4 fc / 

It is finally an easy exercise to check, by using the Chen relations, that: 

0l(ha o ,--- ,(tN-t N -i)a N - 1 )= _! J \ L. ^dxl Hil) ---dxl Hih \ t>0. 



e cr 



0<ti<...<i fe <t 



Remark 3.9 T/ie seminal result of Chen JWjj asserted that 1ii&(x)t was a Lie series. 
The coefficients of this expansion were computed by Strichartz f37\/ . 

3.3 Brownian Chen series 

Chen's theorem can actually be extended to Brownian paths (see Baudoin [3], Ben Arous 
[7J, Castell [IT] . Fliess [15]) and even to rough paths (see Lyons [27], Friz-Victoir [TT]). 

Definition 3.10 If (Bt)t>o is a d-dimensional Brownian motion, the Chen series of B is 
the formal series: 

e(B) t = l+J2 £ / odBll ■ ■ ■ o dB$Z 

k=i ie{o,i,...,d} k ^ J o<ti<...<t k <t 

with the convention B® = t, and o denotes Stratonovitch integral. 

Theorem 3.11 If (Bt)t>o is a d-dimensional Brownian motion, then 

e(B) t = exp I £ £ M B )tXi I , t > o, 

\*>1 J6{0,l,...,d}* / 



jx^-.-x^, t>0, 
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where for k> 1, I G {0, 1, d} k , 



If 



e(a) 



0<ti<...<t fc <t 



y = 2/0 + E E a ii,-,ik X h- X ik- 
fc=l/G{0,l,...,d} fe 

is a random series, that is if the coefficients are real random variables defined on a proba- 
bility space, we will denote 

+oo 

E(Y) = E(y ) + E E Han,...,i k )Xn-X ik . 

k=l le{0,l,...,d} k 

as soon as the coefficients of Y are integrable, where E stands for the expectation. 

The following theorem gives the expectation (see Baudoin [3], Lyons- Victoir [28]) of the 

Brownian Chen series: 

Theorem 3.12 For t > 0, 



E (&(B) t ) = exp (x + \ E X^j j 



Proof. An easy computation shows that if T n is the set of words with length n obtained 
by all the possible concatenations of the words 

{0},{(M)}, i€{l,...,rf}, 

1. If J £ X n then 

E ( /" odB 1 ) = 0; 
y7A«[o,t] J 

2. If / € l n then 

n + n(7) 



E / odB 



i 



A»[o,t] y 2— , — - 



li^iii / n+n(I) V 



where n(/) is the number of in / (observe that since I E I n , n and n(I) necessarily 
have the same parity). 



Therefore, 



+ 00 k + n(I) 

t 2 



E (S(B) t ) = 1 + V y . ... -. ^X h ...X h 



k=i I eJ k z 1 \ — 2 



□ 
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3.4 Exponential of a vector field 

With these new tools in hands we may now come back to our primary purpose, which was 
to transform the stochastic Taylor expansion into an approximation of the stochastic flow 
associated to a stochastic differential equation. 

In order to use the previous formalism, we first need to understand what is the exponential 
of a vector field. Let O C W 1 be a non empty open set and V be a smooth vector field 
on O. It is a basic result in the theory of ordinary differential equations that if K C O is 
compact, there exist e > and a smooth mapping 

$ : (-£,£) xif^O, 

such that for x G K and — e < t < e, 

— {t,x) = X(<f>(t,x)), $(0,x) =x. 

Furthermore, if y : (— 77,77) — > W 1 is a C 1 path such that for — 77 < t < 77, = X(y(t)), 
then = <I>(t,y(0)) for — min(r?, e) < t < min(?7, e). From this characterization of <I> it 
is easily seen that for x G K and t±,t2 £ M such that | ii | + | £2 |< £j 

$(ti,$(t 2 ,a;)) = *(ti + t2,a0. 

Because of this last property, the solution mapping t — ► 3>(i, x) is called the exponential 
mapping, and we denote x) = e tv (x). It always exists if | t \ is sufficiently small. If 
e tv can be defined for any f El, then the vector field is said to be complete. For instance 
if O = W 1 and if V is C°°-bounded then the vector field V is complete. 

3.5 Lie bracket of vector fields 

We also need to introduce the notion of Lie bracket between two vector fields. 

We have already stressed that a vector field V may be seen derivation, that is a map on 

C°°(0,R), linear over M, satisfying for f,g G C°°(0,R), 

V(fg) = (Vf)g + f(Vg). 

Also, conversely, any derivation on C°°(0,R) is a vector field. If V' is another smooth 
vector field on O, then it is easily seen that the operator VV' — V'V is a derivation. It 
therefore defines a smooth vector field on O which is called the Lie bracket of V and V' 
and denoted [V, V 1 }. A straightforward computation shows that for x G O, 




Observe that the Lie bracket satisfies obviously [V, V] = —[V, V] and the so-called Jacobi 
identity, that is: 

[V, [V, V"}] + [V, [V", V}] + [V", [V, V']} = 0. 
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3.6 Castell's approximation theorem 

Combining the stochastic Taylor expansion with the Chen-Strichartz formula leads finally 
to the following result of approximation of stochastic flows which is due to Castell 

Theorem 3.13 (Castell approximation theorem) Let (Bt)t>o be a d- dimensional Brow- 
nian motion. Let us assume that Vq, Vi, ■ ■ ■ ,Vd are C°° bounded vector fields on W 1 , Then, 
for the solution (X^°)t>o of the following stochastic differential equation 



X?° = x + f V (X*°)ds + Y,f Vi(X*°) o dBi 
Jo i=1 Jo 



(3.4) 



we have for every N > 1 , 

Xf° = exp j S Aj(B) t Xj ] (so) + t^TL N (t), 

\k=l Ie{0,l,...,d} k ,k+n(I)<N J 

where n{I) denotes the number of O's in the word I and where the remainder term TLj\r(t) 
is bounded in probability when t — > 0. More precisely, 3 a, c > such that \/A > c, 

( ^+1 , /XI w\ / A a 

mar sup s 2 | H N (s) \> At 2 < exp 

*-^o \0<s<t J \ c 

Remark 3.14 Kunita, in I23\j . proved an extension of this theorem to stochastic flows 
generated by stochastic differential equations driven by Levy processes. 

4 Approximation in small times of solutions of diffusion 
equations 

In this section, we now turn to applications of the previous results on the stochastic Taylor 
expansion to the study in small times of parabolic diffusion equations. 
We consider the following linear partial differential equation 

— = £$, $(0,s) = f(x), (4.5) 
where L is a diffusion operator on M. n that can be written 

z »=i 

the V^'s being smooth and compactly supported^ vector fields on W a . It is known that the 
solution of (14.51) can be written 



^(t,x) = (e tc f)(x) = P t f(x). 



3 This assumption will not be restrictive for us because we shall eventually be interested in local results 
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For instance 



and 



If I G {0, 1, d} k is a word, we denote as before 

V I = [V il ,[V i2 ,...,[V ik _ 1 ,V ik }. 

and 

d{I)=k + n{I), 

where n(I) is the number of 0's in the word /. 
For N > 1, let us consider 

Pf = Elexp( Yl A dB)tV T 

\ \I,d(I)<N 

P^E^exp 

P?=E(exp(^^ + i £ f ' BldBi - BidBUVuVj] 

\ \i=0 l<i<j<d , 

The meaning of this last notation is the following. If / is a smooth and bounded func- 
tion, then {Pf f){x) = E(\I/(l,x)), where \P(t,x) is the solution of the first order partial 
differential equation with random coefficients: 

§^(r,z)= E A i( B )t(Vi^)(r, x), *(0,x) = f(x). 

I,d(I)<N 

Finally, let us consider the following family of norms: If / is a C°° bounded function, then 
for k > 0, 

II / ||fe= sup sup sup || V h ■■■V i J(x) || . 

0<l<kO<h,- ,ii<dxeR n 

Theorem 4.1 Let N > 1 and k>0. If f is a C°° bounded function, then 

II P t / - Pf/ IU= o (t^) , *-o. 

Proof. First, by using the scaling property of Brownian motion and expanding out the 
exponential with Taylor formula we obtain 



expf E mW)/=[E^( E < 

\I,d(I)<N / \k=0 ' \I,d(I)<N 
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where the remainder term R]y(t) is such that E (|| R]y(i) ||fc) is bounded when t — ► 0. 
We now observe that, due to Theorem 13. 11\ the rearrangement of terms in the previous 
formula gives 



Em( E '«) E I, 

yfe=0 \I,d(I)<N J J I,d(I)<N J ^ 



odB I V H ...V i , Il f + t E ^B 2 N (t), 

\[o,t] 



where E (|| Rjy-(i) \\k) is bounded when t — ► 0. Therefore 



exp 



and 



( MB)tVi\f = f+ E I odB I V il ...V iw f + t^B? N (t), 

\I,d(I)<N ) I,d{I)<N 

where E (|| R^(i) ||fc) is bounded when t — ► 0. We now recall (see the proof of Theorem 
13.121 ) that if I n is the set of words with length n obtained by all the possible concatenations 
of the words 

{0}, {(*,»)}, ie{l,...,d}, 



1. If 1 iX n then 



2. If / G l n then 



E | / odB 1 = 0; 
A™[0,t] / 



E / odB 



l 



n+n(I) 



( n+n(I) Y 



A n [0,t] ) 2 — 2 

where n(I) is the number of in / (observe that since I S I n , n and n(I) necessarily 
have the same parity). 



We conclude therefore 



fc<^±± 



Since it is known that 

up,/- E ^/iu=o(t^), 
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the theorem is proved. □ 

The previous approximation theorem may be used to obtain small times heat kernel asymp- 
totics. From now on, we assume furthermore that at given point xq G M n the vector fields 
Vi(xq), Vd(xo) form a basis of W 1 which of course implies n = d. In that case, it is 
known that the random variable Xf° admits a smooth density Pt(%0i ■) with respect to the 
Lebesgue measure of W 1 . In other words, 

P(Xf° edy)= Pt (x ,y)dy, 

for some smooth function p(xq, •) : (0, +00) x M. n — > M>o- 

We are interested in pt(xo,xo) in small times. It is known (see for instance Azencott, pQ), 
that the following asymptotic expansion holds when t — > 0, 



Vfc=0 / 



Pt(x ,x ) = - I > ' a k (x )t* I +0( t N+1 -f) , N > 0, 



for some constants ao(xo),-"" ) a iv(^o)- The following proposition provides an effective 
way to compute these constants. 

Proposition 4.2 For N > 1, ui/ien £ — ► 0, 

Pt{xo,x ) = d t (x ) + O [t 2 I, 

where d^(xo) is the density at 0/ £/ie random variable d(/)<Af ^(^)*^( x o) 

Proof. This is a particular case of Theorem 15.21 which is proven below. □ 

Remark 4.3 This result may also be found in \2S§ . 



For instance, by applying the previous proposition with N = 1, we get 

a ° (X0) = ( 27 r)£|det(Vi(xo),-" ,^(x ))| 

The computation of ai(x) is technically more involved and relies on the so-called Levy's 
area formula. In order to simplify the following computations we shall make some as- 
sumptions that avoid heavy computations, however the described methodology may be 
extended to the general case. First we assume Vb = 0. 

We wish to apply the previous proposition with N = 2. For that, we need to understand 
the law of the random variable 



e t = Y J BiV t (x ) + ± Yl f BidBi-BidB, 

i=l l<i<j<d 



[Vi,Vi](x ). 
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Since L is assumed to be elliptic at xq, we can find ufj such that ufj = —^ji and 

d 

[V i ,V j ](x ) = Y,"ij V k(xo). 
k=i 

Our second assumption is the following skew-symmetry propertjQ uifj = — With these 
notations, we therefore have 

9 t = Wi^ + ~ 4 f BldBi-BidBl\v k (x ). 

k=l \ l<i<j<d / 

By a simple linear transformation, we are reduced to the problem of the computation of 
the law of the M. d - valued random variable 



\ l<i<j<d 



t 

BidB j s - BidBi 



Kk<d 



It is known from the Levy's area formula that if A is a d x d skew-symmetric matrix, then, 
for t > 0, 

E (j&AB M) i Bt = x } = det 1 exp ^ I-tAcottA ^ ^ 

Therefore, for A E M d , 



where 

d 



2 

k=l 



Thus, from the inverse Fourier transform formula, the density of 6t with respect to the 
Lebesgue measure is given by 

According to Proposition 14.21 we are interested in 

4 This assumption, together with Vb = 0, for instance holds when L is the Laplace-Beltrami operator 
on a compact semi-simple Lie group 
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and wish to perform an asymptotic development when t — > 0. By using the standard 
Laplace method, we are led to 



Qt 



which implies therefore: 

p t (xo,x Q ) 



det(Vi(x ),--- ,V d (x ))\ (27rt) 



Remark 4.4 At that time, up to the knowledge of the author, a similar method may not 
be achieved to compute for instance 02 (xq). This is due to the fact that the law of the 
random variable 

Aj(S) t Vr(ar ) 

I,d(I)<N 

is poorly understood when N > 3. However, Proposition \4-%\ makes explicitly appear the 
geometric information contained in the coefficients ai(xo). 



5 Elliptic heat kernels asymptotics on vector bundles and 
the Chern-Gauss-Bonnet theorem 

The goal of the present section is to extend the above results to the study of elliptic heat 
kernels on vector bundles. In particular, we will see that the methods that we developed 
are sharp enough to recover the celebrated heat kernel proof of the Chern-Gauss-Bonnet 
theorem. The relevant geometric quantity involved in this theorem is hidden very far in 
the asymptotic development of the heat kernel but will appear in a straightforward way 
by using stochastic Taylor expansions. 

Our first goal is to adapt what we did before, to vector bundles on compact Riemannian 
manifolds. We assume here from the reader some basic knowledge on Riemannian man- 
ifolds, vector bundles and linear connections, as may be found in any graduate textbook 
on differential geometry. 



5.1 Elliptic heat kernels asymptotics on vector bundles 

Let M be a (i-dimensional compact smooth Riemannian manifold and let £ be a finite- 
dimensional vector bundle over M. We denote by r(M, £) the space of smooth sections of 
this bundle. Let now V denote a connection on E. 
We consider the following linear partial differential equation 

^ = £$, $((),*) = /(*), (5.6) 
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where C is an operator on £ that can be written 

d 



1=1 
with 

V< = ^ + Vv i , 0<i<d, 

the Vi's being smooth vector fields on M and the ^i's being smooth potentials (that is 
smooth sections of the bundle End(£ )). As in the scalar case, it is known that the solution 
of (15. 6ft can be written 

$(t,x) = (e tC f)(x) = P t f(x), 
where Pj is a contraction semigroup of operators. If I G {0, 1, d} k is a word, we denote 

V/ = [Vi 1 ,[Vi 2 ,... ! [Vi fc _ 1 ,ViJ...]. 

and 

d(I) =k + n(I), 

where n(I) is the number of in the word /. 
For N > 1, let us consider 

Pf = E I exp I Yl A d B )tVi 

As before for the scalar case, the meaning of this last notation is the following. If / G 
r(M,£), then (Pf f)(x) = E(^(l,x)), where *(r,x) is the solution of the first order 
partial differential equation with random coefficients: 

^(r,x)= Yl Aj(S) t (V/*)(r,x), *(0,x) = /(x). 

I,d(I)<N 

Let us consider the following family of norms: If / G r(M, £), for k > 0, 
|| / \\ k = sup sup sup || Vjj • • • V»j/(sc) || • 

0<Z<fcO<ii,-,ii<da:eM 

We have the following extension of Theorem 14.11 which may be proved in the very same 
way. 

Theorem 5.1 Let N > 1 and k>0. For f G r(M,£), 

||p t /-pf/IU=o(V^V t-o. 
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Let us now assume that the operator L is elliptic at xq G M in the sense that (Vi(xo), Vd(xo)) 
is an orthonormal basis of the tangent space at xq. In that case, Pt is known to admit a 
smooth Schwartz kernel at xq. That is, there exists a smooth map 

p( % -):%o-r(M,Hom(f)) 

such that for / G T(M, £), 

(Pt/)(x ) = / p t (x ,y)f(y)dy. 
Jm 

Theorem 5.2 Let N > 1. There exists a map 

jA(x 0) -):K>o->r(M,Hom(£)) 

suc/i that for f £ T(M,£), 

(Pf/)(x )= / P?(x ,y)f(y)dy. 
Jm 

Moreover, 

Pt(xo,x ) = Pt{xQ,X ) + (t^iH^ . 

Proo/. 

The proof is not simple. We shall proceed in several steps. In a first step, we shall show 
the existence of a kernel at xq for acting on functions. In a second step we shall deduce 
by parallel transport, the existence of p N (xq, •). And finally, we shall prove the required 
estimate. 



First step: 

Let us define, 

Qf = Ejex P l £ ki{B) t Vi 
\ \/,d(J)<JV 

In order to show that admits a kernel at xo, we show that for t > 0, the stochastic 
process 

Zf = exp( £ A 7 (5) t Vr] 0r„) 
\W)<iV / 

has a density with respect to the Riemannian measure of M. To this end, from the well- 
known criterion of Malliavin (see [30], [31]), we show that the Malliavin matrix of Zf is 
invertible with probability one. A sufficient condition for that, is 

rojzf, i = i,...,d, 
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forms a basis of the tangent space at xq where W denotes the i — th partial Malliavin's 
derivative taken at time 0. An easy computation shows that 

DjZf = Vi(x ), t > 0. 

Our ellipticity assumption gives therefore the existence of q N (xo, •) : R>o x M — > R>o, 
such that for every smooth / : M — > R, 

(Q?f)(xo)= I q?(xo,v)f(v)dy. 
Jm 



Second step: 

For t > 0, let us consider the operator Q^(xq) defined on r(M, £) by the property that 
for j) e T(M,£) and y G O xo , 



(9f(x )r ? )(y)=E 



exp[ Yl A/(^)tV/)r? 

I,d(I)<N 



Oo) 



exp ( £ Aj(B) t Fj ] (x ) = y 

I,d{I)<N 



We claim that G^(xo) is actually a potential, that is a section of the bundle End(£ ). For 
that, we have to show that for every smooth / : M — ► R and every 77 £ T(M, £ ), y G C^, 

(0f (x )/i7)(y) = /(y)(ef (x )»7)(y). 

If / is a smooth function on M, we denote by JVtf the operator on r(M, £) that acts by 
multiplication by /. Due to the Leibniz rule for connections, we have for any word /: 



[V I ,Mf]=M Vl f. 



Consequently, 



M 



Ai(B) t Vi,M f 

I,d(I)<N 

The above commutation property implies the following one: 



E/, d( /)<ivA/(B)tV//- 



exp A i( B )tVi \ M f = M 



J,d(I)<N 
Therefore, 



exp(E/ 



d(I)<N ' 



,(B)^)/ exp ^ Aj(B) t Vj . 



.I,d(I)<N 



[e?(x ),M f ] = o, 



so that 0^(xo) is a section of the bundle End(£). We can now conclude with the disin- 
tegration formula that for every 77 G T(M,£), 



(Pf,)(x»)=/pf(xo,^W* 
Jm 



22 



with 



Final step: 

Let us now turn to the proof of the pointwise estimate 

AT/ \ ( N + 1 ~ d \ 

Pt{xo,x ) =p t {x ,x ) + [t 2 j, t -> 0. 

Let y £ M be sufficiently close to xq. Since C is elliptic at xo, it is known (see for instance 
Chapter 2 [8]) that pt(xo,y) admits a development 

^ = ^TTt^/a ( S ^ fe ^ ' ^ + ^ R iv(*, x , y) J , (5.7) 

where the remainder term Hpf(t,Xo,y) is bounded when t — > 0, ^(xo,-) is a section of 
End(£) defined around xq and cZ(-, •) is the distance defined around xo by the vector fields 
Vi, Vrf. By using the fact that for every smooth / : M — ► R, 

(Qf /)(x ) = E ^/ ^exp ( £ A,(B) t y^ (x ) j j , t > 0, 

and classical results for asymptotic development in small times of subelliptic heat kernels 
(see for instance [6] and Chapter 3 of [3] ), we get for q^(xQ,y) a development that is 
similar to (|5.7p . For Q^(xq), the scaling property of Brownian motion implies that we 

k 

have a short-time asymptotics in powers tz , k £ N. Since, 
we deduce that 

P?{xo,y) = 6 ( 27Tt y/2 \ yi^k{xo,y)ti +t E ¥-R N (t,x ,y)J , 

where the remainder term Rjv(i, XQ,y) is bounded when t — ► 0. With Theorem 14. we 
obtain that = ^fc> k = 0, N, and the required estimate easily follows. □ 

Remark 5.3 The question of the smoothness ofp^ is not addressed here. It would require 
bounds on the inverse of the Malliavin matrix of . 

From the previous theorem, we deduce an explicit asymptotic expansion of pt(xo,xo). If 
I G {0, 1, d} k , k > 2, is a word, we denote 

Ti = Vj - V Vl E r(M,End(£)). 
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Corollary 5.4 For N > 1, when t — * 0, 
Pt(xo,£o) = (x )E exp ^ A I (B) t J r I (x ) 



Y A I (B) t V I (x ) = ] +0(t 

I,d(I)<N 



where d^ (xq) is the density at of the random variable Y^i d(i)<N ^i(B)tVi(xo)- 

Proof. Let us first observe that for the same reason than in the proof of step 1 of the 
above theorem, the random process 

Y Aj(S) t Vr(ar ) 

I,d(I)<N 

has a density d^(xo,-). Therefore, due to the disintegration formula, for every smooth 
•q 6 r(M,£), 

Y Ai(B) t V I (x ) = o\ d?(x , 

I,d(I)<N J 



(Pf n)(x ) = I Elexpf Aj(S) t V/)77(x ) 

^T, M y \W)<JV / 



and the proof follows by letting r] converge to Dirac distribution at xq. □ 

The previous theorem may be used to prove various local index theorems. We focus here on 
the simplest one which is the Chern-Gauss-Bonnet theorem. A proof of the Atiyah-Singer 
local index theorem for the Dirac operator on spin manifolds can be found in [5]. 
For alternative proofs that use stochastic differential geometry, we refer to [S] and |21| . 

5.2 The Chern-Gauss-Bonnet theorem 

Let M be a d-dimensional Riemannian, compact, smooth manifold. The Chern-Gauss- 
Bonnet theorem proved by Chern [13] in 1944 is the following: 

Theorem 5.5 Let x(M) be the Euler characteristic o/M. If d is odd, then x(M) = 0. // 
d is even then 

X(M) = / uj(x)dx, 



where u(x)dx is the Euler form, that is the volume form given in a local orthonormal 
frame ej by 

(_lW2 d-i 

u = (87r)d/2(d/2)l ^ £ ( <r ) f ( T )II R »( i )»( i + 1 )#( i + 1 ) (l1 ' 

where T,^ is the set of the permutations of the indices {l,...,d}, e the signature of a 
permutation, and 

Rijki = (R(ej,ek)ei, et) , 
with R Riemannian curvature o/M. 
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The striking feature of Chern-Gauss-Bonnet theorem that makes it so beautiful is that 
the Euler form depends on the Riemannian metric whereas x(M) is only a topological 
invariant. We now turn to a short proof of it that uses the tools we developed in these 
notes. In the sequel, we always assume that the dimension d is even. 

We first briefly recall some basic facts on Fermion calculus on the Clifford exterior algebra 
of a finite dimensional vector space, as can be found in Section 2.2.2 of [M] and that will 
be used in our proof. Let V be a d-dimensional Euclidean vector space. We denote V* its 
dual and 

av* = 0a¥, 

k>0 

the exterior algebra. If u G V* , we denote a* the map AV* — ► AV*, such that a*(w) = 
uAuj. The dual map is denoted a u . Let now 8%, 6^ be an orthonormal basis of V* . We 
denote <ij = a$ i . We have the basic rules of Fermion calculus 

{ai,aj} = 0,{a*,a*} = 0, {di,a*} = (%, 

where {•,•} stands for the anti-commutator: {a,i,aj} = cuaj + ajOj. If I and J are two 
words with 1 < i\ < • • • < i k < d and 1 < j\ < • • • < ji < d, we denote 

A IJ = a* i ---a* k a jl ---a jr 

The family of all the possible Ajj forms a basis of the 2 M -dimensional vector space 
End (AT/*). 

If A = Y^iJ c IjAu £ End (AV*), we shall say that 

If A € End (AV*), we define the supertrace Str(A) as the difference of the trace of A on 
even forms minus the trace of A on odd forms. 

One of the the interests of Fermion calculus (which is equivalent to Clifford calculus) is 
that it makes easy to compute supertraces: If A = Y2i j cijAjj, then 

Str(A) = (-l) d -^c {l _ d}{l _ d} . (5.8) 

We now carry the Fermionic construction on the tangent spaces of our manifold M. Let 
be a local orthonormal frame and let 6i be its dual frame. The curvature endomorphism 
is defined by 

T = — Rijkia*aja* k ai 

ijkl 

where 

Rijki = (R(ej,e k )ei,ei) , 

with R Riemannian curvature of M. This definition is actually intrinsic, i.e. does not 
depend on the choice of the local frame. In this setting, the celebrated Weitzenbock 
formula reads 

□ = A + T, 
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where □ = dd* + d*d is the Hodge-DeRham Laplacian and A the Bochner Laplacian. Let 
us recall that if a is a local orthonormal frame, we have the following explicit formula for 
A: 

d 

A = -^(V ei V ei -Vve i e i ), 
i=l 

where V is the Levi-Civita connection. 



After these preliminaries, we can now turn to the proof of Chern-Gauss-Bonnet theorem. 
From now on, we suppose that the dimension d is even. The first crucial step is McKean- 
Singer formula [29] (A simple proof of it can be found in [34J page 113). We have 



/Str pt{x, x)dx, t > 0, 



where = e~* D and pt is the corresponding Schwartz kernel (density). In other words, 
the supertrace is constant along the heat semigroup associated with the Hodge-DeRham 
Laplacian and this constant is equal to the Euler characteristic. 
An easily proved and non surprising precise statement is the following: When t — > 0, 

1 T 

sup || p t (x,x) T e (x) \\= O 

xeM (47rf)2 



fd/2-l 



But as seen in the next proposition, due to Corollarv l5.4l fantastic and subtle cancellation^ 
occur at the paths level when we take the supertrace: 

Proposition 5.6 For every x £ M, 

limStr pt(x,x) =lo(x). 

Proof. Let x$ £ M be fixed once time or all in the following proof. We work in a 
synchronous local orthonormal frame around xq, that is Vej = at xq. At the point 
xq, we have 



d 
i=l 

and therefore 

d 

We want to apply Corollary 15.41 in the present framework. 

We denote JF = —\T, Ti = 0, 1 < i < d and if I £ {0, 1, d} k is a word, 

^/ = [v J1 ,[v 42 ,...,[v lfc _ 1 ,v J j...]-v [eiiJe!2i ... i[eifc _ reiJ ... ] Gr(M,End(Ar 



5 We quote here McKean- Singer |29| who conjectured these cancellations of terms that involve many 
covariant derivatives of curvature terms. 
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with the convention eo = 0, Vo = Fq, Vj = V ej , 1 < i < d. According to Corollary 15.41 
we thus have for N > 1, and t — > 0, 

Ptftixo, x ) = qf(x )M exp ^ Aj(B) t J r i(xo) Yl A i( B )tei(x ) = +O (r^) 

\ \/,(i(-0<JV / I,d{I)<N J 



where (xq) is the density at of the random variable J2i d(i)<N A i( B )t e i( x o)- Applying 
this when N = d gives 



^ A I (B) t e I (x ) = o\+o(ytj 

I,d{I)<N J 



Pt/2(xo,x ) = qf(x )E exp ^ A/(S)t^/(x ) 

\ \-f,d(/)<d 

By using the scaling property of Brownian motion, it is easily seen that for k > | + 1, 
/ 



q?(x )E 



\ 



Ai(B) t Fi(x ) 

K I,d{I)<d 



MB)tei(x ) = ) =o(Vt) • 
I,d(I)<N J 



Therefore 



p t / 2 (xo,x ) = qf(x )E 



( d/2 I I \ 
E^ E Az(B)^/(x ) 

. fc=0 \/,(i(/)<d / 



£ Aj(S) t ej(x ) = ] +0 (Vt' 
I,d(I)<N 



and 



Str p t / 2 (sco,»o) = 9t (»o)E 



E^i Str ( E AK^Kxq)] ^ A I (B) t e I (x ) = o\+o(yi) 

yk=0 ' \I,d(I)<d J I,d(I)<N / 



Since a routine computation shows that ^ 7 MI)<d A i{ B )t^i{xo) may be written as a linear 
combination of terms a*dj, a*aja* k ai, due to the formula (|5.8|) . we have for k < | — 1, 



Str ( Aj(B)*Fj(a;o) J =0. 

vl,d(l)<d 



Consequently, 



Str p t / 2 (sco,x ) = <? t d (x )IE 



-1-Str ( £ AjCS)^/^) 



£ Aj(B) t ei(x ) = 0]+O (7* 

I,d(I)<N 



27 



By using again the scaling property of Brownian motion, we have 
I ■ 

qf(x )E 



J^Str ( A i( B )tfi(x ) 

I,d(I)<d 



(d/2) 



A I {B) t e I (x ) = 

I,d(I)<N 



We finally end up with 

Str p t /2(x ,x ) 
which proves that 



A I (B) t e I (x ) = 0\ +0(Vt 

I,d(I)<N 



' dq?(x )Str F (x )% +o(Vi 



(d/2)! 



1)^ , ^ 

Str^(x )2. 



lim Str pt(xn, %o) - , 
t^o m u; (d/2)!(4vr) d / 2 

Heavy, but straightforward computations (see [33], Lemma 2.35) show that 

(_l)d/2 



(d/2)!(4vr) d / 2 



Str ^(xq) 2 = lv(xq) 



□ 
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